Accurate marginalization range for missing data recognition

نویسندگان

  • Sébastien Demange
  • Christophe Cerisara
  • Jean Paul Haton
چکیده

Missing data recognition has been proposed to increase noise robustness of automatic speech recognition. This strategy relies on the use of a spectrographic mask that gives information about the true clean speech energy of a corrupted signal. This information is then used to refine the data process during the decoding step. We propose in this work a new mask that provides more information about the clean speech contribution than classical masks based on a Signal to Noise Ratio (SNR) thresholding. The proposed mask is described and compared to another missing data approach based on SNR thresholding. Experimental results show a significant word error rate reduction induced by the proposed approach. Moreover, the proposed mask outperforms the ETSI advanced front-end on the HIWIRE corpus.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Missing data techniques: Feature reconstruction

Automatic speech recognition (ASR) performance degrades rapidly when speech is corrupted with increasing levels of noise. Missing data techniques (MDT) constitute a family of methods that tackle noise robust speech recognition based on the so called missing data assumption proposed in [1]. MDTs assume that (i) the noisy speech signal can be divided in speech-dominated (reliable) and noise-domin...

متن کامل

State based imputation of missing data for robust speech recognition and speech enhancement

Within the context of continuous-density HMM speech recognition in noise, we report on imputation of missing time-frequency regions using emission state probability distributions. Spectral subtraction and local signal–to– noise estimation based criteria are used to separate the present from the missing components. We consider two approaches to the problem of classification with missing data: ma...

متن کامل

On noise masking for automatic missing data speech recognition: A survey and discussion

Automatic speech recognition (ASR) has reached very high levels of performance in controlled situations. However, the performance degrades significantly when environmental noise occurs during the recognition process. Nowadays, the major challenge is to reach a good robustness to adverse conditions, so that automatic speech recognizers can be used in real situations. Missing data theory is a ver...

متن کامل

Missing-feature method for speaker recognition in band-restricted conditions

In this study, the missing-feature method is considered to address band-limited speech for speaker recognition. In an effort to mitigate possible degradation due to the general speaker independent model, a two-step reconstruction scheme is developed, where speaker class independent/dependent models are used separately. An advanced marginalization in the cepstral domain is proposed employing a h...

متن کامل

Handling derivative filterbank features in bounded-marginalization-based missing data automatic speech recognition

This paper extends the familiar missing-data boundedmarginalization technique from static to dynamic filterbank features for noise robust automatic speech recognition. Based on a well-known theorem from Statistics it is shown how the reliability of derivative filterbank features can be expressed in form of a probability density function. As another contribution, the corresponding HMM state emis...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2007